12 research outputs found

    The pale spear-nosed bat : a neuromolecular and transgenic model for vocal learning

    Get PDF
    Funding: UK Research and Innovation (Grant Number(s): MR/T021985/1; Grant recipient(s): Sonja Vernes). Max-Planck-Gesellschaft (Grant Number(s): Max Planck Research Group ; Grant recipient(s): Sonja Vernes). Human Frontier Science Program (Grant Number(s): RGP0058/2016, RGP0058/2016; Grant recipient(s): Uwe Firzlaff, Sonja Vernes).Vocal learning, the ability to produce modified vocalizations via learning from acoustic signals, is a key trait in the evolution of speech. While extensively studied in songbirds, mammalian models for vocal learning are rare. Bats present a promising study system given their gregarious natures, small size, and the ability of some species to be maintained in captive colonies. We utilize the pale spear-nosed bat (Phyllostomus discolor) and report advances in establishing this species as a tractable model for understanding vocal learning. We have taken an interdisciplinary approach, aiming to provide an integrated understanding across genomics (Part I), neurobiology (Part II), and transgenics (Part III). In Part I, we generated new, high-quality genome annotations of coding genes and noncoding microRNAs to facilitate functional and evolutionary studies. In Part II, we traced connections between auditory-related brain regions and reported neuroimaging to explore the structure of the brain and gene expression patterns to highlight brain regions. In Part III, we created the first successful transgenic bats by manipulating the expression of FoxP2, a speech-related gene. These interdisciplinary approaches are facilitating a mechanistic and evolutionary understanding of mammalian vocal learning and can also contribute to other areas of investigation that utilize P. discolor or bats as study species.Publisher PDFPeer reviewe

    Six reference-quality genomes reveal evolution of bat adaptations

    Get PDF
    Bats possess extraordinary adaptations, including flight, echolocation, extreme longevity and unique immunity. High-quality genomes are crucial for understanding the molecular basis and evolution of these traits. Here we incorporated long-read sequencing and state-of-the-art scaffolding protocols to generate, to our knowledge, the first reference-quality genomes of six bat species (Rhinolophus ferrumequinum, Rousettus aegyptiacus, Phyllostomus discolor, Myotis myotis, Pipistrellus kuhlii and Molossus molossus). We integrated gene projections from our �Tool to infer Orthologs from Genome Alignments� (TOGA) software with de novo and homology gene predictions as well as short- and long-read transcriptomics to generate highly complete gene annotations. To resolve the phylogenetic position of bats within Laurasiatheria, we applied several phylogenetic methods to comprehensive sets of orthologous protein-coding and noncoding regions of the genome, and identified a basal origin for bats within Scrotifera. Our genome-wide screens revealed positive selection on hearing-related genes in the ancestral branch of bats, which is indicative of laryngeal echolocation being an ancestral trait in this clade. We found selection and loss of immunity-related genes (including pro-inflammatory NF-κB regulators) and expansions of anti-viral APOBEC3 genes, which highlights molecular mechanisms that may contribute to the exceptional immunity of bats. Genomic integrations of diverse viruses provide a genomic record of historical tolerance to viral infection in bats. Finally, we found and experimentally validated bat-specific variation in microRNAs, which may regulate bat-specific gene-expression programs. Our reference-quality bat genomes provide the resources required to uncover and validate the genomic basis of adaptations of bats, and stimulate new avenues of research that are directly relevant to human health and disease.s E.W.M. and M.P. were supported by the Max Planck Society and were partially funded by the Federal Ministry of Education and Research (grant 01IS18026C). All data produced in Dresden were funded directly by the Max Planck Society. S.C.V., P.D. and K.L. were funded by a Max Planck Research Group awarded to S.C.V. from the Max Planck Society, and a Human Frontiers Science Program (HFSP) Research grant awarded to S.C.V. (RGP0058/2016). M.H. was funded by the German Research Foundation (HI 1423/3-1) and the Max Planck Society. E.C.T. was funded by a European Research Council Research Grant (ERC2012-StG311000), UCD Wellcome Institutional Strategic Support Fund, financed jointly by University College Dublin and SFI-HRB-Wellcome Biomedical Research Partnership (ref 204844/Z/16/Z) and Irish Research Council Consolidator Laureate Award. G.M.H. was funded by a UCD Ad Astra Fellowship. G.J. and E.C.T. were funded from the Royal Society/Royal Irish Academy cost share programme. L.M.D. was supported by NSF-DEB 1442142 and 1838273, and NSF-DGE 1633299. D.A.R. was supported by NSF-DEB 1838283. E.D.J. and O.F. were funded by the Rockefeller University and the Howard Hughes Medical Institute. We thank Stony Brook Research Computing and Cyberinfrastructure, and the Institute for Advanced Computational Science at Stony Brook University for access to the high-performance SeaWulf computing system (which was made possible by a National Science Foundation grant (no. 1531492)); the Long Read Team of the DRESDEN-concept Genome Center, DFG NGS Competence Center, part of the Center for Molecular and Cellular Bioengineering (CMCB), Technische Universität Dresden; S. Kuenzel and his team of the Max Planck Institute of Evolutionary Biology; members of the Vertebrate Genomes Laboratory at The Rockefeller University for their support; L. Wiegrebe, U. Firzlaff and M. Yartsev, who gave us access to captive colonies of Phyllostomus and Rousettus bats and aided with tissue sample collection; and M. Springer, for completing the SVDquartet analyses, and providing phylogenetic input and expertise

    Comprehensive characterization of copy number variation (CNV) called from array, long- and short-read data

    Get PDF
    Background SNP arrays, short- and long-read genome sequencing are genome-wide high-throughput technologies that may be used to assay copy number variants (CNVs) in a personal genome. Each of these technologies comes with its own limitations and biases, many of which are well-known, but not all of them are thoroughly quantified. Results We assembled an ensemble of public datasets of published CNV calls and raw data for the well-studied Genome in a Bottle individual NA12878. This assembly represents a variety of methods and pipelines used for CNV calling from array, short- and long-read technologies. We then performed cross-technology comparisons regarding their ability to call CNVs. Different from other studies, we refrained from using the golden standard. Instead, we attempted to validate the CNV calls by the raw data of each technology. Conclusions Our study confirms that long-read platforms enable recalling CNVs in genomic regions inaccessible to arrays or short reads. We also found that the reproducibility of a CNV by different pipelines within each technology is strongly linked to other CNV evidence measures. Importantly, the three technologies show distinct public database frequency profiles, which differ depending on what technology the database was built on

    SeeCiTe: a method to assess CNV calls from SNP arrays using trio data

    No full text
    Motivation Single nucleotide polymorphism (SNP) genotyping arrays remain an attractive platform for assaying copy number variants (CNVs) in large population-wide cohorts. However, current tools for calling CNVs are still prone to extensive false positive calls when applied to biobank scale arrays. Moreover, there is a lack of methods exploiting cohorts with trios available (e.g. nuclear family) to assist in quality control and downstream analyses following the calling. Results We developed SeeCiTe (Seeing CNVs in Trios), a novel CNV-quality control tool that postprocesses output from current CNV-calling tools exploiting child-parent trio data to classify calls in quality categories and provide a set of visualizations for each putative CNV call in the offspring. We apply it to the Norwegian Mother, Father and Child Cohort Study (MoBa) and show that SeeCiTe improves the specificity and sensitivity compared to the common empiric filtering strategies. To our knowledge, it is the first tool that utilizes probe-level CNV data in trios (and singletons) to systematically highlight potential artifacts and visualize signal intensities in a streamlined fashion suitable for biobank scale studies

    SeeCiTe: a method to assess CNV calls from SNP arrays using trio data

    No full text
    Motivation Single nucleotide polymorphism (SNP) genotyping arrays remain an attractive platform for assaying copy number variants (CNVs) in large population-wide cohorts. However, current tools for calling CNVs are still prone to extensive false positive calls when applied to biobank scale arrays. Moreover, there is a lack of methods exploiting cohorts with trios available (e.g. nuclear family) to assist in quality control and downstream analyses following the calling. Results We developed SeeCiTe (Seeing CNVs in Trios), a novel CNV-quality control tool that postprocesses output from current CNV-calling tools exploiting child-parent trio data to classify calls in quality categories and provide a set of visualizations for each putative CNV call in the offspring. We apply it to the Norwegian Mother, Father and Child Cohort Study (MoBa) and show that SeeCiTe improves the specificity and sensitivity compared to the common empiric filtering strategies. To our knowledge, it is the first tool that utilizes probe-level CNV data in trios (and singletons) to systematically highlight potential artifacts and visualize signal intensities in a streamlined fashion suitable for biobank scale studies.publishedVersio

    Population prevalence and inheritance pattern of recurrent CNVs associated with neurodevelopmental disorders in 12,252 newborns and their parents

    Get PDF
    Recurrent copy number variations (CNVs) are common causes of neurodevelopmental disorders (NDDs) and associated with a range of psychiatric traits. These CNVs occur at defined genomic regions that are particularly prone to recurrent deletions and duplications and often exhibit variable expressivity and incomplete penetrance. Robust estimates of the population prevalence and inheritance pattern of recurrent CNVs associated with neurodevelopmental disorders (NDD CNVs) are lacking. Here we perform array-based CNV calling in 12,252 mother–father–child trios from the Norwegian Mother, Father, and Child Cohort Study (MoBa) and analyse the inheritance pattern of 26 recurrent NDD CNVs in 13 genomic regions. We estimate the total prevalence of recurrent NDD CNVs (duplications and deletions) in live-born children to 0.48% (95% C.I.: 0.37–0.62%), i.e., ~1 in 200 newborns has either a deletion or duplication in these NDDs associated regions. Approximately a third of the newborn recurrent NDD CNVs (34%, N = 20/59) are de novo variants. We provide prevalence estimates and inheritance information for each of the 26 NDD CNVs and find higher prevalence than previously reported for 1q21.1 deletions (~1:2000), 15q11.2 duplications (~1:4000), 15q13.3 microdeletions (~1:2500), 16p11.2 proximal microdeletions (~1:2000) and 17q12 deletions (~1:4000) and lower than previously reported prevalence for the 22q11.2 deletion (~1:12,000). In conclusion, our analysis of an unselected and representative population of newborns and their parents provides a clearer picture of the rate of recurrent microdeletions/duplications implicated in neurodevelopmental delay. These results will provide an important resource for genetic diagnostics and counseling

    Six reference-quality genomes reveal evolution of bat adaptations

    No full text
    Bats possess extraordinary adaptations, including flight, echolocation, extreme longevity and unique immunity. High-quality genomes are crucial for understanding the molecular basis and evolution of these traits. Here we incorporated long-read sequencing and state-of-the-art scaffolding protocols1 to generate, to our knowledge, the first reference-quality genomes of six bat species (Rhinolophus ferrumequinum, Rousettus aegyptiacus, Phyllostomus discolor, Myotis myotis, Pipistrellus kuhlii and Molossus molossus). We integrated gene projections from our 'Tool to infer Orthologs from Genome Alignments' (TOGA) software with de novo and homology gene predictions as well as short- and long-read transcriptomics to generate highly complete gene annotations. To resolve the phylogenetic position of bats within Laurasiatheria, we applied several phylogenetic methods to comprehensive sets of orthologous protein-coding and noncoding regions of the genome, and identified a basal origin for bats within Scrotifera. Our genome-wide screens revealed positive selection on hearing-related genes in the ancestral branch of bats, which is indicative of laryngeal echolocation being an ancestral trait in this clade. We found selection and loss of immunity-related genes (including pro-inflammatory NF-κB regulators) and expansions of anti-viral APOBEC3 genes, which highlights molecular mechanisms that may contribute to the exceptional immunity of bats. Genomic integrations of diverse viruses provide a genomic record of historical tolerance to viral infection in bats. Finally, we found and experimentally validated bat-specific variation in microRNAs, which may regulate bat-specific gene-expression programs. Our reference-quality bat genomes provide the resources required to uncover and validate the genomic basis of adaptations of bats, and stimulate new avenues of research that are directly relevant to human health and disease1.publishe
    corecore